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RNA _BINDING_ PROTEIN AND BINDING SITE 
USEFUL FOR EXPRESSION OF RECOMBINANT M OLECULES 

Technical Fi^d 

The invention relates to expression systems and methods for 
expression of desired genes and gene products in cells. 
Particularly, the invention relates to a gene encoding a RNA 
binding protein useful for regulating gene expression in cells, 
the protein binding site, a gene encoding a regulating protein 
disulfide isoraerase and methods and systems for gene expression 
of recombinant molecules. 



15 • .Background 

Expression systems for expression of exogenous foreign 
genes in eukaryotic and prokaryotic cells are basic components 
of recombinant DNA technology. Despite the abundance of 
expression systems and their wide -spread use, they all have 

2 0 characteristic disadvantages. For example, while expression in 

E. coli is probably the most popular as it is easy to grow and 
is well understood, eukaryotic proteins expressed therein are 
not properly modified. Moreover, those proteins tend to 
precipitate into insoluble aggregates and are difficult to 
25 obtain in large amounts. Mammalian expression systems, while 

practical on small-scale protein production, are more difficult, 
time-consuming and expensive than in E. coli. 

A number of plant expression systems exist as well as 
summarized in US Patent No. 5,234,834, the disclosures of which 

3 0 are hereby incorporated by reference. One advantage of plants 

or algae in an expression system is that they can be used to 
produce pharmacologically important proteins and enzymes on a 
large scale and in relatively pure form. In addition, 
micro-algae have several unique characteristics that make them 
35 ideal organisms for the production of proteins on a large scale. 
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First, unlike most systems presently used to produce transgenic 
proteins, algae can be grown in minimal media (inorganic salts) 
using sunlight as the energy source. These algae can be grown 
in contained fermentation vessels or on large scale in monitored 
ponds. Ponds of up to several acres are routinely used for the 
production of micro-algae. Second, plants and algae have two 
distinct compartments, the cytoplasm and the chloroplast , in 
which proteins can be expressed. The cytoplasm of algae is 
similar to that of other eukaryotic organisms used for protein 
expression, like yeast and insect cell cultures. The 
chloroplast is unique to plants and algae and proteins expressed 
in this environment are likely to have properties different from 
those of cytoplasmically expressed proteins. 

The present invention describes an expression system in 
which exogenous molecules are readily expressed in either 
prokaryotic or eukaryotic hosts and in either the cytoplasm or 
chloroplast . These beneficial attributes are based on the 
discovery and cloning of components of translation regulation in 
plants as described in the present invention. 

Protein translation plays a key role in the regulation of 
gene expression across the spectrum of organisms (Kozak, Ann . 
Rev. Cell Biol . , 8:197-225 (1992) and de Smit and Van Duin, 
Proa. Nucleic Acid Res. Mol . Biol. . 38:1-35 (1990)). The 
majority of regulatory schemes characterized to date involve 
translational repression often involving proteins binding to 
mRNA to limit ribosome association (Winter et al . , Proc . Natl . 
Acad. Sci . . USA , 84:7822-7826 (1987) and Tang and Draper, 
Biochem. . 29:4434-4439 (1990)). Translational activation has 
also been observed (Wulczyn and Kahmann, Cell . 65:259-26 9 
(1991) ) , but few of the underlying molecular mechanisms for this 
type of regulation have been identified. In plants, light 
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activates the expression of many genes. Light has been shown to 
activate expression of specific chloroplast encoded mRNAs by- 
increasing translation initiation (Mayfield et al . , Ann . Rev . 
Plant Physiol. Plant Mol . Biol. , 46:147-166 (1995) and Yohn et 
al., Mol . Cell Biol . . 16:3560-3566 (1996)). Genetic evidence in 
higher plants and algae has shown that nuclear encoded factors 
are required for translat ional activation of specific 
chloroplast encoded mRNAs (Rochaix et al . , Embo J. , 8:1013-1021 
(1989), Kuchka et al . , Cell . 58:869-876 (1989), Girard-Bascou et 
al., Embo J . . 13:3170-3181 (1994), Kim et al . , Plant Mol . Biol . , 
127 : 1537-1545 (1994) . 

In the green algae Chlamydomonas reinha.rd.tii, a number of 
nuclear mutants have been identified that affect translation of 
single specific mRNAs in the chloroplast, often acting at 
translation initiation (Yohn et al . , supra , (1996)) . Mutational 
analysis of chloroplast mRNAs has identified sequence elements 
within the 5' untranslated region (UTR) of mRNAs that are 
required for translat ional activation (Mayfield et al . , supra . 

(1995), Mayfield et al . , J. Cell Biol . . 127:1537-1545 (1994) and 
Rochaix, Ann. Rev. Cell Biol. . 8:1-28 (1992)), and the 5' UTR of 
a chloroplast mRNA can confer a specific translation phenotype 
on a reporter gene in vivo (Zerges and Rochaix, Mol . Cell Biol . , 
14:5268-5277 (1994) and Staub and Maliga, Embo J. , 12:601-606 

(1993) . 

Putative translational activator proteins were identified 
by purifying a complex of four proteins that binds with high 
affinity and specificity to the 5' UTR of the chloroplast 
encoded psbA mRNA [encoding the Dl protein, a major component of 
Photosystem II (PS II)] (Danon and Mayfield, Embo J . . 10:3993- 
4001 (1991)). Binding of these proteins to the 5' UTR of psbA 
mRNA correlates with translation of this mRNA under a variety of 
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physiological (Danon and Mayfield, id. , (1991) ) and biochemical 
conditions (Danon and Mayfield, Science, 266:1717-1719 (1994) 
and Danon and Mayfield, Embo J. . 13:2227-2235 (1994)), and in 
different genetic backgrounds (Yohn et al . , supra , (1996)). The 
binding of this complex to the psbA mRNA can be regulated in 
vitro in response to both redox potential (Danon and Mayfield, 
Science , 266:1717-1719 (1994)) and phosphorylation (Danon and 
Mayfield, Embo J . . 13:2227-2235 (1994)), both of which are 
thought to transduce the light signal to activate translation of 
psbA mRNA. The 47 kDa member of the psbA RNA binding complex 
(RB47) is in close contact with the RNA, and antisera specific 
to this protein inhibits binding to the psbA mRNA in vitro 
(Danon and Mayfield, supra . (1991) ) . 

Although the translational control of psbA mRNA by RB4 7 has 
been reported, the protein has not been extensively 
characterized and the gene encoding RB4 7 has not been 
identified, cloned and sequenced. In addition, the regulatory 
control of the activation of RNA binding activity to the binding 
site by nuclear-encoded trans-acting factors, such as RB60, have 
not been fully understood. The present invention now describes 
the cloning and sequencing of both RB47 and RB60 . Based on the 
translation regulation mechanisms of RB47 and RB60 with the RB47 
binding site, the present invention also describes a translation 
regulated expression system for use in both prokaryotes and 
eukaryotes . 

Brief Descript ion of the Invention 

The RB4 7 gene encoding the RB47 activator protein has now 
been cloned and sequenced, and the target binding site for RB47 
on messenger RNA (mRNA) has now been identified. In addition, a 
regulatory protein disulfide isomerase, a 6 0 kilodalton protein 
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referred to as RB6 0, has also been cloned, sequenced and 
characterized. Thus, the present invention is directed to gene 
expression systems in eukaryotic and prokaryotic cells based on 
translational regulation by RB47 protein, its binding site and 
the RB60 regulation of RB47 binding site activation. 

More particularly, the present invention describes the use 
of the RB47 binding site, i.e., a 5 1 untranslated region (UTR) 
of the chloroplast psbA gene, in the context of an expression 
system for regulating the expression of genes encoding a desired 
recombinant molecule. Protein translation is effected by the 
combination of the RB47 binding site and the RB47 binding 
protein in the presence of protein translation components. 
Regulation can be further imposed with the use of the RB6 0 
regulatory protein disulfide isomerase. Therefore, the present 
invention describes reagents and expression cassettes for 
controlling gene expression by affecting translation of a coding 
nucleic acid sequence in a cell expression system. 

Thus, in one embodiment, the invention contemplates a RB4 7 
binding site sequence, i.e., a mRNA sequence, typically a mRNA 
leader sequence, which contains the RB47 binding site. A 
preferred RB47 binding site is psbA mRNA. For use in expressing 
recombinant molecules, the RB47 binding site is typically 
inserted 5 ' to the coding region of the preselected molecule to 
be expressed. In a preferred embodiment, the RB47 binding site 
is inserted into the 5 ' untranslated region along with an 
upstream psbA promoter to drive the expression of a preselected 
nucleic acid encoding a desired molecule. In alternative 
embodiments, the RB47 binding site is inserted into the 
regulatory region downstream of any suitable promoter present in 
a eukaryotic or prokaryotic expression vector. Preferably, the 
RB47 binding site is positioned within 100 nucleotides of the 
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translation initiation site. In a further aspect, 3' to the 
coding region is a 3 ' untranslated region (3' UTR) necessary for 
transcription termination and RNA processing. 

Thus, in a preferred embodiment, the invention contemplates 
an expression cassette or vector that contains a transcription 
unit constructed for expression of a preselected nucleic acid or 
gene such that upon transcription, the resulting mRNA contains 
the RB4 7 binding site for regulation of the translation of the 
preselected gene transcript through the binding of the 
activating RB47 protein. The RB47 protein is provided 
endogenously in a recipient cell and/or is a recombinant protein 
expressed in that cell. 

Thus, the invention also contemplates a nucleic acid 
molecule containing the sequence of the RB47 gene. The nucleic 
acid molecule is preferably in an expression vector capable of 
expressing the gene in a cell for use in interacting with a RB47 
binding site. The invention therefore contemplates an expressed 
recombinant RB4 7 protein. In one embodiment, the RB47 binding 
site and RB4 7 encoding nucleotide sequences are provided on the 
same genetic element. In alternative embodiments, the RB47 
binding site and RB47 encoding nucleotide sequences are provided 
separately . 

The invention further contemplates a nucleic acid molecule 
containing the sequence encoding the 6 9 kilodalton precursor to 
RB4 7. In alternative embodiments, the RB4 7 nucleic acid 
sequence contains a sequence of nucleotides to encode a 
histidine tag. Thus, the invention relates to the use of 
recombinant RB47, precursor RB47, and histidine-modif ied RB47 
for use in enhancing translation of a desired nucleic acid. 

The invention further contemplates a nucleic acid molecule 
containing a nucleotide sequence of a polypeptide which 
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regulates the binding of RB47 to RB47 binding site. A preferred 
regulatory molecule is the protein disulfide isomerase RB60. 
The RB60-encoding nucleic acid molecule is preferably in an 
expression vector capable of expressing the gene in a cell for 
use in regulating the interaction of RB47 with a RB47 binding 
site. Thus, the invention also contemplates an expressed 
recombinant RB60 protein. In one embodiment, the RB47 binding 
site, RB47 encoding and RB60 encoding nucleotide sequences are 
provided on the same genetic element. In alternative 
embodiments, the expression control nucleotide sequences are 
provided separately. In a further aspect, the RB60 gene and 
RB47 binding site sequence are provided on the same construct. 

The invention can therefore be a cell culture system, an in 
vitro expression system or a whole tissue, preferably a plant, 
in which the transcription unit is present that contains the 
RB4 7 binding site and further includes a (1) transcription unit 
capable of expressing RB47 protein or (2) the endogenous RB4 7 
protein itself for the purpose of enhancing translation of the 
preselected gene having an RB47 binding site in the mRNA. 
Preferred cell culture systems are eukaryotic and prokaryotic 
cells. Particularly preferred cell culture systems include 
plants and more preferably algae. 

A further preferred embodiment includes (1) a separate 
transcription unit capable of expressing a regulatory molecule, 
preferably RB6 0 protein, or (2) the endogenous RB6 0 protein 
itself for the purpose of regulating translation of the 
preselected gene having an RB47 binding site in the mRNA. In an 
alternative preferred embodiment, one transcription unit is 
capable of expressing both the RB47 and RB60 proteins. In a 
further aspect, the RB47 binding site sequence and RB60 sequence 
are provided on the same construct . 
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In one aspect of the present invention, plant cells 
endogenously containing RB47 and RB60 proteins are used for the 
expression of recombinant molecules, such as proteins or 
polypeptides, through activation of the RB47 binding in an 
exogenously supplied expression cassette. Alternatively, stable 
plant cell lines containing endogenous RB47 and RB60 are first 
generated in which RB47 and/or RB60 proteins are overexpressed . 
Overexpression is obtained preferably through the stable 
transformation of the plant cell with one or more expression 
cassettes for encoding recombinant RB47 and RB60. In a further 
embodiment, stable cell lines, such as mammalian or bacterial 
cell lines, lacking endogenous RB47 and/or R360 proteins are 
created that express exogenous RB47 and/or RB60. 

Plants for use with the present invention can be a 
transgenic plant, or a plant in which the genetic elements of 
the invention have been introduced. Based on the property of 
controlled translation provided by the combined use of the RB47 
protein and the RB47 binding site, translation can be regulated 
for any gene product, and the system can be introduced into any 
plant species. Similarly, the invention is useful for any 
prokaryotic or eukaryotic cell system. 

Methods for the preparation of expression vectors is well 
known in the recombinant DNA arts, and for expression in plants 
is well known in the transgenic plant arts. These particulars 
are not essential to the practice of the invention, and 
therefore will not be considered as limiting. 

The invention allows for high level of protein synthesis in 
plant chloroplasts and in the cytoplasm of both prokaryotic and 
eukaryotic cells. Because the chloroplast is such a productive 
plant organ, synthesis in chloroplasts is a preferred site of 
translation by virtue of the large amounts of protein that can 
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be produced. This aspect provides for great advantages in 
agricultural production of mass quantities of a preselected 
protein product . 

The invention further provides for the ability to screen 
for agonists or antagonists of the binding of RB47 to the RB47 
binding site using the expression systems as described herein. 
Antagonists of the binding are useful in the prevention of plant 
propagation . 

Also contemplated by the present invention is a screening 
assay for agonists or antagonists of RB60 in a manner analogous 
to that described above for RB4 7. Such agonists or antagonists 
would be useful in general to modify expression of RB60 as a way 
to regulate cellular processes in a redox manner. 

Kits containing expression cassettes and expression 
systems, along with packaging materials comprising a label with 
instructions for use, as described in the claimed embodiments 
are also contemplated for use in practicing the methods of this 
invention . 

Other uses will be apparent to one skilled in the art in 
light of the present disclosures. 

Brief Description of Drawings 

In the figures forming a portion of this disclosure: 
Figures 1A-1D show the complete protein amino acid residue 
sequence of RB47 is shown from residues 1-623, together with the 
corresponding nucleic acid sequence encoding the RB4 7 sequence, 
from base 1 to base 2732. The nucleotide coding region is shown 
from base 197-2065, the precursor form. The mature form is from 
nucleotide position 197-1402. Also shown is the mRNA leader, 
bases 1-196, and poly A tail of the mRNA, bases 2066-2732. Both 
the nucleotide and amino acid sequence are listed in SEQ ID NO 
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be produced. This aspect provides for great advantages in 
agricultural production of mass quantities of a preselected 
protein product . 

The invention further provides for the ability to screen 
for agonists or antagonists of the binding of RB47 to the RB47 
binding site using the expression systems as described herein. 
Antagonists of the binding are useful in the prevention of plant 
propagation. 

Also contemplated by the present invention is a screening 
assay for agonists or antagonists of RB60 in a manner analogous 
to that described above for RB47. Such agonists or antagonists 
would be useful in general to modify expression of RB60 as a way 
to regulate cellular processes in a redox manner. 

Kits containing expression cassettes and expression 
systems, along with packaging materials comprising a label with 
instructions for use, as described in the claimed embodiments 
are also contemplated for use in practicing the methods of this 
invention C 

Other uses will be apparent to one skilled in the art in 
light of the present disclosures. 

Rriftf Descript ion of Drawings 

In the figures forming a portion of this disclosure: 
Figures 1A-1D show' the complete protein amino acid residue 
sequence of RB47 is shown from residues 1-623, together with the 
corresponding nucleic acid sequence encoding the RB47 sequence, 
from base 1 to base 2732. The nucleotide coding region is shown 
from base 197-2065, the precursor form. The mature form is from 
nucleotide position 197-1402. Also shown is the mRNA leader, 
bases 1-196, and poly A tail of the mRNA, bases 2066-2732. Both 
the nucleotide and amino acid sequence are listed in SEQ ID NO 
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5. 

Figures 2A-2B show the complete protein amino acid residue 
sequence of RB60 is shown from residues 1-488, together with the 
corresponding nucleic acid sequence from base 1 to base 2413, of 
which bases 16-1614 encode the RB60 sequence. Both the 
nucleotide and amino acid sequence are listed in SEQ ID NO 10. 

Figures 3A-3C show the complete sequence of the psbA mRNA, 
showing both encoded psbA protein amino acid residue sequence 
(residues 1-352) and the nucleic acid sequence as further 
described in Example 3 is illustrated. Both the nucleotide and 
amino acid sequence are listed in SEQ ID NO 13 . 

Figure 4 is a schematic diagram of an expression cassette 
containing on one transcription unit from 5 « to 3 ' , a promoter 
region derived from the psbA gene for encoding the Dl protein 
from C. reinhardtii further containing a transcription 
initiation site (TS) , the RB47 binding site, a region for 
insertion of a foreign or heterologous coding region, a RB47 
coding region, a RB60 coding region, and the 3' flanking region 
containing transcription termination site (TS) , flanked by an 
origin of replication and selection marker. Restriction 
endonuclease sites for facilitating insertion of the independent 
genetic elements are indicated and further described in Example 
4A. 

Figures 5A-5B show the nucleotide and amino acid sequence 
of the RB47 molecule containing a histidine tag, the sequences 
of which are also listed in SEQ ID NO 14. 

Figure 6 is a schematic diagram of an expression cassette 
containing on one transcription unit from 5 ' to 3 ' , a promoter 
region derived from the psbA gene for encoding the Dl protein 
from C. reinhardtii further containing a transcription 
initiation site (TS) , the RB47 binding site, a region for 

***** 
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RB4 7 is also shown in Figures 1A-1D (SEQ ID NO 5) . As 
described in Section 2 above, the predicted protein 
sequence from the cloned cDNA contained both the derived 
peptide sequences of RB47 and is highly homologous to 
5 poly (A) binding proteins (PABP) from a variety of 
eukaryotic organisms. 

2. cloning of RB$Q 

To clone the cDNA encoding the 60 kDa psJbA mRNA 

10 binding protein (RB60) , the psfcA- specific RNA binding 
proteins were purified from light-grown C. reinhardtii 
cells using heparin-agarose chromatography followed by 
psJbA RNA affinity chromatography (RAC) . RAC-purified 
proteins were separated by two-dimensional 

15 polyacrylamide gel electrophoresis. The region 

1 corresponding to RB60 was isolated from the PVDF 
membrane. RB60 protein was then digested with trypsin. 

: Unambiguous amino acid sequences were obtained from two 
-1 peptide tryptic fragments ( WFVDGELASD YNGPR (SEQ ID NO 6) 

2 0 and (QLILWTTADDLKADAEIMTVFR (SEQ ID NO 7) ) as described 

above for RB47. The calculated molecular weights of the 
two tryptic peptides used for further analysis precisely 
matched with the molecular weights determine by mass 
spectrometry. The DNA sequence corresponding to one 

25 peptide of 22 amino acid residues was amplified by PCR 
using degenerate oligonucleotides, the forward primer 
5 ' CGCGGATCCGAYGCBGAGATYATGAC3 1 (SEQ ID NO 8) and the 
reverse primer 5 ' CGCGAATTCGTCATRATCTCVGCRTC3 ' (SEQ ID NO 
9) , where R can be A or G (the other IUPAC nucleotides 

3 0 have been previously defined above) . The amplified 

sequence was then used to screen a X-gtlO cDNA library 
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from C. reinhardtii . Three clones were identified with 
the largest being 2.2 kb. Selection and- sequencing was 
performed as described for RB47 cDNA. 

The resulting RB60 cDNA sequence is available via 
5 GenBank (Accession Number AF027727) . The nucleotide and 
encoded amino acid sequence of RB60 is also shown in 
Figures 2A-2B (SEQ ID NO 10) . The protein coding 
sequence of 488 amino acid residues corresponds to 
nucleotide positions 16-1614 of the 2413 base pair 
10 sequence. The predicted amino acid sequence of the 

cloned cDNA contained the complete amino acid sequences 
of the two tryptic peptides. The amino acid sequence of 
the encoded protein revealed that it has high sequence 
homology to both plant and mammalian protein disulfide 
15 isomerase (PDI) , and contains the highly conserved 

thioredoxin-like domains with -CysGlyHisCys- (-CGHC-) 
(SEQ ID NO 11) catalytic sites in both the N-terminal 
- and C-terminal regions and the -LysAspGluLeu- (-KDEL-) 

(SEQ ID NO 12) endoplasmic reticulum (ER) retention 
20 signal at the C-terminus found in all PDIs. PDI is a 
mutifunctional protein possessing enzymatic activities 
for the formation, reduction, and isomerization of 
disulfide bonds during protein folding, and is typically 
found in the ER. The first 3 0 amino acid residues of 
25 RB60 were found to lack sequence homology with the 
N-terminal signal sequence of PDI from plants or 
mammalian cells. However, this region has 
characteristics of chloroplast transit peptides of C. 
reinhardtii , which have similarities with both 
3 0 mitochondrial and higher plant chloroplast presequences . 
A transit peptide sequence should override the function 
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of the -KDEL- ER retention signal and target the protein 
to the chloroplast since the -KDEL- signal acts only to 
retain the transported protein in the ER. 

5 3 . Prsparat inn of p.qhA Pro moter Sequence and RB47 

binding s ite Nucleotide Sequence 
The chloroplast psbA gene from the green 
unicellular alga C. reinhardii was cloned and sequenced 
as described by Erickson et al . , EmkQ J. , 3:2753-2762 

10 (1984) , the disclosure of which is hereby incorporated 
by reference. The DNA sequence of the coding regions 
and the 5' and 3' untranslated (UTR) flanking sequences 
of the C. reinhardii psbA gene is shown in Figures 3A- 
3C. The psJbA gene sequence is also available through 

15 GenBank as further discussed in Example 4. The 

nucleotide sequence is also listed as SEQ ID NO 13. The 
deduced amino acid sequence (also listed in SEQ ID NO 
13) of the coding region is shown below each codon 
beginning with the first methionine in the open reading 

20 frame. Indicated in the 5' non-coding sequence are a 
putative Shine -Dalgarno sequence in the dotted box, two 
putative transcription initiation sites determined by SI 
mapping (SI) and the Pribnow-10 sequence in the closed 
box. Inverted repeats of eight or more base pairs are 

2 5 marked with arrows and labeled A-D. A direct repeat of 

31 base pairs with only two mismatches is marked with 
arrows labeled 31. Indicated in the 3' non-coding 
sequence is a large inverted repeat marked by a forward 
arrow and the SI cleavage site marking the 3 ' end of the 

3 0 mRNA. Both the 5' and 3' untranslated regions are used 

in preparing one of the expression cassettes of this 
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invention as further described below. 

The 5 1 UTR as previously discussed contains both 
the psbA promoter and the RB47 binding site. The 
nucleotide sequence defining the psbA promoter contains 
the region of the psbA DNA involved in binding of RNA 
polymerase to initiate transcription. The -10 sequence 
component of the psbA promoter is indicated by the boxed 
nucleotide sequence upstream of the first SI while the - 
3 5 sequence is located approximately 35 bases before the 
putative initiation site. As shown in Figures 3A-3C, 
the -10 sequence is boxed, above which is the nucleotide 
position (-100) from the first translated codon. The - 
35 sequence is determined accordingly. A psbA promoter 
for use in an expression cassette of this invention ends 
at the first indicated SI site (nucleotide position -92 
as counting from the first ATG) in Figures 3A-3C and 
extends to the 5 ! end (nucleotide position -251 as shown 
in Figures 3A-3C) . Thus, the promoter region is 160 
bases in length. A more preferred promoter region 
extends at least 100 nucleotides to the 5' end from the 
SI site. A most preferred region contains nucleotide 
sequence ending at the si site and extending 5 1 to 
include the -35 sequence, i.e., from -92 to -130 as 
counted from the first encoded amino acid residue (39 



The psbA RB47 binding site region begins at the 
first SI site as shown in Figures 3A-3C and extends to 
the first adenine base of the first encoded methionine 
residue. Thus, a psbA RB47 binding site in the psbA 
gene corresponds to the nucleotide positions from -91 to 
-1 as shown in Figure 3A-3C. 
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The above -identified regions are used to prepare 
expression constructs as described below. The promoter 
and RB47 binding site regions can be used separately; 
for example, the RB47 binding site sequence can be 
5 isolated and used in a eukaryotic or prokaryotic plasmid 
with a non-psbA promoter. Alternatively, the entire 
psbA 5 1 UTR having 251 nucleotides as shown in Figures 
3A-3C is used for the regulatory region in an expression 
cassette containing both the psJbA promoter and RB47 
10 binding site sequence as described below. 

4 . Prpparatinn of E xpression Vectors and Expression of 
Coding Sequences 

A. Pnnstruc t s Containing an VfihA Prompter , an , 
15 RT347 Binding Site Nucle otide Sequence, a 

nsBirftd Heterologo us Coding Sequence, an RB47- 
Rnnndina Sequenc e and an RB60 -Encoding 
: Sequence 

Plasmid expression vector constructs, 
20 alternatively called plasmids, vectors, constructs and 
the like, are constructed containing various 
combinations of elements of the present invention as 
described in the following examples. Variations of the 
positioning and operably linking of the genetic elements 

2 5 described in the present invention and in the examples 

below are contemplated for use in practicing the methods 
of this invention. Methods for manipulating DNA 
elements into operable expression cassettes are well 
known in the art of molecular biology. Accordingly, 

3 0 variations of control elements, such as constitutive or 

inducible promoters, with respect to prokaryotic or 
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eukaryotic expression systems as described in Section C. 
are contemplated herein although not enumerated. 
Moreover, the expression the various elements is not 
limited to one transcript producing one mRNA; the 
5 invention contemplates protein expression from more than 
one transcript if desired. 

As such, while the examples below recite one or two 
types of expression cassettes, the genetic elements of 
RB47 binding site, any desired coding sequence, in 

10 combination with RB47 and RB60 coding sequences along 
with a promoter are readily combined in a number of 
operably linked permeations depending on the 
requirements of the cell system selected for the 
expression. For example, for expression in a 

15 chloroplast, endogenous RB47 protein is present 

therefore an expression cassette having an RB47 binding 
■ 1 site and a desired coding sequence is minimally required 

: along with an operative promoter sequence. 

1 Overexpression of RB47 may be preferable to enhance the 
20 translation of the coding sequence; in that case, the 

chloroplast is further transformed with an expression 
cassette containing an RB47-encoding sequence. Although 
the examples herein and below utilize primarily the 
sequence encoding the precursor form of RB47, any of the 

2 5 RB4 7 -encoding sequences described in the present 

invention, i.e., RB47 precursor, mature RB47 and 
histidine-modif ied RB47 are contemplated for use in any 
expression cassette and system as described herein. To 
regulate the activation of translation, an RB60-encoding 

3 0 element is provided to the expression system to provide 

the ability to regulate redox potential in the cell as 
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taught in Section B. These examples herein and below 
represent a few of the possible permutations of genetic 
elements for expression in the methods of this 
invention . 

5 In one embodiment, a plasmid is constructed 

containing an RB47 binding site directly upstream of an 
inserted coding region for a heterologous protein of 
interest, and the RB47 and RB60 coding regions. 
Heterologous refers to the nature of the coding region 

10 being dissimilar and not from the same gene as the 

regulatory molecules in the plasmid, such as RB47 and 
RB60. Thus, all the genetic elements of the present 
invention are produced in one transcript from the IPTG- 
inducible psbA promoter. Alternative promoters are 

15 similarly acceptable. 

The final construct described herein for use in a 
prokaryotic expression system makes a single mRNA from 
which all three proteins are translated. The starting 
plasmid is any E. coli based plasmid containing an 

20 origin of replication and selectable marker gene. For 
this example, the Bluescript plasmid, pBS, commercially 
available through Stratagene, Inc., La Jolla, CA, which 
contains a polylinker-cloning site and an ampicilin 
resistant marker is selected for the vector. 

25 The wild- type or native psJbA gene (Erickson et al . , 

Bmbo J. . 3:2753-2762 (1984), also shown in Figures 3A- 
3C, is cloned into pBS at the EcoRI and BamHI sites of 
the polylinker. The nucleotide sequence of the psbA 
gene is available on GenBank with the 5 ■ UTR and 3 ' UTR 

30 respectively listed in Accession Numbers X01424 and 

X02350. The EcoRI site of psbA is 1.5 kb upstream of 
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the psbA initiation codon and the BamHI site is 2 kb 
downstream of the stop codon. This plasmid is referred 
to as pDl . 

Using site-directed PCR mutagenesis, well known to 
one of ordinary skill in the art, an Ndel site is placed 
at the initiation codon of psbA in the pDl plasmid so 
that the ATG of the Ndel restriction site is the ATG 
initiation codon. This plasmid is referred to as 
pDl/Nde. An Nde site is then placed at the initiation 
codon of the gene encoding the heterologous protein of 
interest and an Xho I site is placed directly downstream 
(within 10 nucleotides) of the TAA stop codon of the 
heterologous protein coding sequence. Again using site- 
directed mutagenesis, an Xhol site is placed within 10 
nucleotides of the initiation codon of RB47, the 
preparation of which is described in Example 2, and an 
NotI site is placed directly downstream of the stop 
codon of RB47. The heterologous coding region and the 
RB47 gene are then ligated into pDl/Nde so that the 
heterologous protein gene is directly adjacent to the 
RB4 7 binding site and the RB47 coding region is 
downstream of the heterologous coding region, using the 
Xho I site at the heterologous stop codon and the Not I 
site of the pDl polylinker. 

These genetic manipulations result in a plasmid 
containing the 5 ' end of the psbA gene including the 
promoter region and with the RB47 binding site 
immediately upstream of a heterologous coding region, 
and the RB4 7 coding region immediately downstream of the 
heterologous coding region. The nucleotides between the 
stop codon of the heterologous coding region and the 
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initiation codon of the RB4 7 coding region is preferably 
less than 2 0 nucleotides and preferably does not contain 
any additional stop codons in any reading frame. This 
plasmid is referred to as pDl/RB47. 
5 Using site-directed mutagenesis, a NotI site is 

placed immediately {within 10 nucleotides) upstream of 
the initiation codon of RB60, the preparation of which 
is described in Example 2, and an Xba I site is placed 
downstream of the RB60 stop codon. This DNA fragment is 
— 10 then ligated to the 3' end of the psJbA gene using the 
~ Xba I site found in the 3 • end of the psbA gene so that 
the psbK 3' end is downstream of the RB60 coding region. 
This fragment is then ligated into the pDl/RB47 plasmid 
using the NotI and BamHI sites so that the RB60 coding 
15 region directly follows the RB47 coding region. The 
= resulting plasmid is designated pDl/RB47/RB60 . 
u Preferably there is less then 2 0 nucleotides between the 
" RB47 and RB60 coding regions and preferably there are no 

stop codons in any reading frame in that region. The 
20 final plasmid thus contains the following genetic 

elements operably linked in the 5' to 3 ' direction: the 
5 * end of the psbA gene with a promoter capable of 
directing transcription in chloroplasts, an RB47 binding 
site, a desired heterologous coding region, the RB47 
25 coding region, the RB60 coding region, and the 3' end of 
the psbA gene which contains a transcription termination 
and mRNA processing site, and an E. coli origin of 
replication and amplicillin resistance gene. A diagram 
of this plasmid with the restriction sites is shown in 
3 0 Figure 4. 

Expression of pDl/RB47/RB60 in E. coli to produce 
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recombinant RB4 7, RB6 0 and the recombinant heterologous 
protein is performed as described in Example 4B. The 
heterologous protein is then purified as further 
described. 

5 Expression cassettes in which the sequences 

encoding RB47 and RB60 are similarly operably linked to 
a heterologous coding sequence having the psbA RB47 
binding site as described in Example 3 are prepared with 
a different promoter for use in eukaryotic, such as 
-~ 10 mammalian expression systems. In this aspect, the 

cassette is similarly prepared as described above with 
the exception that restriction cloning sites are 
dependent upon the available multiple cloning sites in 
the recipient vector. Thus, the RB47 binding site 
15 prepared in Example 3 is prepared for directed ligation 
: into a selected expression vector downstream of the 
promoter in that vector. The RB47 and RB60 coding 
sequences are obtained from the pDl/RB47/RB60 plasmid by 
^ digestion with Xhol and Xbal and inserted into a 

20 similarly digested vector if the sites are present. 

Alternatively, site-directed mutagenesis is utilized to 
create appropriate linkers. A desired heterologous 
coding sequence is similarly ligated into the vector for 
expression. 

25 

B. CnnBtrucl-.fl nonta inina RB47 Nucleotide Sequence 
1) Purified R ecombinant RR47 Protein 
In one approach to obtain purified 
recombinant RB47 protein, the full length RB47 cDNA 
3 0 prepared above was cloned into the E. coli expression 

vector pET3A (Studier et al . , Methods Enzvmol . , 185:60- 
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89 (1990)), also commercially available by Novagen, 
Inc., Madison, WI and transformed into BL21 E. coli 
cells. The cells were grown to a density of 0.4 (OD S0O ) , 
then induced with 0 . 5 mM IPTG. Cells were then allowed 
5 to grow for an additional 4 hours, at which point they 
were pelleted and frozen. 

Confirmation of the identity of the cloned cDNA as 
encoding the authentic RB47 protein was accomplished by 
examining protein expressed from the cDNA by immunoblot 
— 10 analysis and by RNA binding activity assay. The 

recombinant RB47 protein produced when the RB47 cDNA was 
expressed was recognized by antisera raised against the 
C. reinhardtii RB47 protein. The E. coli expressed 
protein migrated at 8 0 kDa on SDS-PAGE, but the protein 
15 was actually 69 kDa, as determined by mass spectrometry 
= of the E. coli expressed protein. This mass agrees with 
the mass predicted from the cDNA sequence. A 60 kDa 
product was also produced in E. coli, and recognized by 
the antisera against the C. reinhardtii protein, which 
20 is most likely a degradation or early termination 

product of the RB47 cDNA. The recombinant RB47 protein 
expressed from the RB47 cDNA is recognized by the 
antisera raised against the C. reinhardtii protein at 
levels similar to the recognition of the authentic C. 

2 5 reinhardtii RB47 protein, demonstrating that the cloned 

cDNA produces a protein product that is immunologically 
related to the naturally produced RB47 protein. In 
order to generate a recombinant equivalent of the 
endogenous native RB47, the location of the 47 kDa 

3 0 polypeptide was mapped on the full-length recombinant 

protein by comparing mass spectrometric data of tryptic 
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digests of the C. reinhardtii 47 kDa protein and the 
full-length recombinant protein. Thus, peptide mapping 
by mass spectrometry has shown that the endogenous RB47 
protein corresponds primarily to the RNA binding domains 
5 contained within the N- terminal region of the predicted 
precursor protein, suggesting that a cleavage event is 
necessary to produce the mature 47 kDa protein. Thus, 
full-length recombinant RB47 is 69 kDa and contains a 
carboxy domain that is cleaved in vivo to generate the 
10 endogenous mature form of RB47 that is 47 kDa. 
= To determine if the heterologously expressed RB47 

protein was capable of binding the psbA RNA, the E. coli 
expressed protein was purified by heparin agarose 
chromatography. The recombinant RB47 protein expressed 
15 in E. coli was purified using a protocol similar to that 

used previously for purification of RB47 from C. 
" reinhardtii. Approximately 5 g of E. coli cells grown 

as described above were resuspended in low salt 
= extraction buffer (10 mM Tris [pH 7.5], 10 mM NaCl, 10 
2 0 mM MgCl 2 , 5 mM S-mercaptoethanol) and disrupted by 

sonication. The soluble cell extract was applied to a 5 
mL Econo-Pac heparin cartridge (Bio-Rad) which was 
washed prior to elution of the RB47 protein (Danon and 
Mayfield, Kmbo J. . 10:3993-4001 (1991)). 
2 5 The E. coli expressed protein that bound to the 

heparin agarose matrix was eluted from the column at the 
same salt concentration as used to elute the authentic 
C. reinhardtii RB47 protein. This protein fraction was 
used in in vitro binding assays with the psbA 5' UTR. 
30 Both the 69 and 60 kDa E. coli expressed proteins 

crosslinked to the radiolabeled psbA 5' UTR at levels 
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similar to crosslinking of the endogenous RB47 protein, 
when the RNA/protein complex is subjected to UV 
irradiation. 

Heparin agarose purified proteins, both from the E.~ 
5 coli expressed RB47 cDNA and from C. reinhardtii cells, 
were used in an RNA gel mobility shift assay to 
determine the relative affinity and specificity of these 
proteins for the 5' UTR of the psbA mRNA. The E. coli 
_ expressed proteins bound to the psbA 5' UTR in vitro 

10 with properties that are similar to those of the 
['■*■. endogenous RB47 protein purified from C. reinhardtii. 
- RNA binding to both the E. coli expressed and the 

endogenous RB47 protein was competed using either 200 
fold excess of unlabeled psjbA RNA or 2 00 fold excess of 
15 poly (A) RNA. RNA binding to either of these proteins 
was poorly competed using 200 fold excess of total RNA 
or 200 fold excess of the 5' UTR of the psbD or psbC 
=. RNAs . Different forms of the RB47 protein (47 kDa 
r . endogenous protein vs. the 69 kDa E. coli expressed 

2 0 protein) may account for the slight differences in 

mobility observed when comparing the binding profiles of 
purified C. reinhardtii protein to heterologously 
expressed RB47. 

The mature form of RB47 is also produced in 

2 5 recombinant form by the insertion by PCR of an 

artificial stop codon in the RB47 cDNA at nucleotide 
positions 14 03-14 05 with a stop codon resulting in a 
mature RB47 recombinant protein having 4 02 amino acids 
as shown in Figures 1A-1D. An example of this is shown 

3 0 in Figures 5A-5B for the production of a recombinant 

histidine-modif ied RB47 mature protein as described 
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below. The complete RB47 cDNA is inserted into an 
expression vector, such as pET3A as described above, for 
expression of the mature 47 kDa form of the RB47 
protein. In the absence of the inserted stop codon, the 
transcript reads through to nucleotide position 2066- 
2 0 68 at the TAA stop codon to produce the precursor RB47 
having the above -described molecular weight 
characteristics and 623 amino acid residues. 

Recombinant RB47 is also expressed and purified in 
plant cells. For this aspect, C. reinhardtii strains 
were grown in complete media (Tris-acetate-phosphate 
[TAP] (Harris, The Chi amy -donas Sourcebook, San Diego, 
CA, Academic Press (1989)) to a density of 5x10 s cells/mL 
under constant light. Cells were harvested by 
centrifugation at 4°C for 5 minutes at 4,000 g. Cells 
were either used immediately or frozen in liquid N 2 for 
storage at -70°C. 

Recombinant RB47 protein was also produced as a 
modified RB47 protein with a histidine tag at the amino- 
terminus according to well known expression methods 
using pET19-D vectors available from Novagen, Inc., 
Madison, WI . The nucleotide and amino acid sequence of 
a recombinant histidine -modified RB47 of the mature 47 
kDa form is shown in Figures 5A-5B with the nucleotide 
and amino acid sequence also listed in SEQ ID NO 14. 
Thus the nucleotide sequence of a histidine-modif ied 
RB47 is 1269 bases in length. The precursor form of the 
RB47 protein is similarly obtained in the expression 
system, both of which are modified by the presence of a 
histidine tag that allows for purification by metal 
affinity chromatography. 
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The recombinant histidine-modif ied RB47 purified 
through addition of a poly-histidine tag followed by Ni* 2 
column chromatography showed similar binding 
characteristics as that described for recombinant 
5 precursor RB47 described above. 

C. f7nnat-.ruct-.fl Containing R RfiO Nucleotide Sequence 
In one approach to obtain purified recombinant 
RB60 protein, the full-length RB60 cDNA prepared above 

10 was cloned into the E. coll expression vector pET3A 

(Studier et al . , Methods Enzvmol . , 185:60-89 (1990)}, 
also commercially available by Novagen, Inc., Madison, 
WI and transformed into BL21 E. coli cells. The cells 
were grown to a density of 0.4 (OD 600 ) , then induced with 

15 0.5 mM IPTG. Cells were then allowed to grow for an 
additional 4 hours, at which point they were pelleted 
and frozen. 

- Recombinant histidine-modif ied RB60 was also 

expressed with a pET19-D vector as described above for 

2 0 RB47 that was similarly modified. Purification of the 

recombinant RB60 proteins was performed as described for 
RB47 thereby producing recombinant RB60 proteins for use 
in the present invention. 

The RB60 coding sequence is also mutagenized for 
25 directional ligation into an selected vector for 

expression in alternative systems, such as mammalian 
expression systems. 

D. Construc ts Containing an RB4 7 -Encoding 

3 0 Sequence and an 0 -Encoding Sequence 

To prepare an expression cassette for encoding 
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both RB4 7 and RB60, one approach is to digest plasmid 
pDl/RB47/RB60 prepared above with Xhol and Xbal to 
isolate the fragment for both encoding sequences. The 
fragment is then inserted into a similarly digested 
5 expression vector if available or is further mutagenized 
to prepare appropriate restriction sites. 

Alternatively, the nucleotide sequences of RB4 7 and 
RB60, as described in Example 2, are separately prepared 
for directional ligation into a selected vector. 

10 An additional embodiment of the present invention 

is to prepare an expression cassette containing the RB47 
binding site along with the coding sequences for RB47 
and RB6 0, the plasmid pDl/RB47/RB60 prepared above is 
digested with Ndel and Xhol to prepare an expression 

15 cassette in which any desired coding sequence having 
similarly restriction sites is directionally ligated. 
Expression vectors containing both the RB47 and RB60 
encoding sequences in which the RB47 binding site 
sequence is utilized with a different promoter are also 

2 0 prepared as described in Example 4A. 



E. Constructs Containing an RB47 Binding Site 
Nucleotid e Sequence. Insertion Sites for a 
Desired Heterologous Coding Sequence, and an 
25 RB4 7 -Encoding Sequence 

In another permutation, a plasmid or 
expression cassette is constructed containing an RB47 
binding site directly upstream of an inserted coding 
region for a heterologous protein of interest, and the 
3 0 RB47 coding region. The final construct described 

herein for use in a prokaryotic expression system makes 
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a single mRNA from which both proteins are translated. 

The plasmid referred to as pDl/RB47 is prepared as 
described above in Example 4A. A diagram of this 
plasmid with the restriction sites is shown in Figure 6. 
5 Expression of pDl/RB47 in E. coll to produce 

recombinant RB47 and the recombinant heterologous 
protein is performed as described in above. The 
heterologous protein is then purified as further 
described. 

10 To produce an expression cassette that allows for 

insertion of an alternative desired coding sequence, the 
plasmid pDl/RB47 is digested with Ndel and Xhol 
resulting in a vector having restriction endonuclease 
sites for insertion of a desired coding sequence 

15 operably linked to a RB47 binding site and RB47 coding 
sequence on one transcriptional unit. 

= F. Construc t s Containing an RB47 Binding Site 

Nurn poti^p fifiouencie. In aprCion fii te.s for a 
2 0 Hftsired Ttel-.ft role- gnus Coding Sequence, and an 

RB4 7 -Encoding Se quence 
In another permutation, a plasmid or 
expression cassette is constructed containing an RB47 
binding site directly upstream of an inserted coding 
25 region for a heterologous protein of interest, and the 
RB60 coding region. The final construct described 
herein for use in a prokaryotic expression system makes 
a single mRNA from which both proteins are translated. 
In this embodiment, a separate construct encoding 
3 0 recombinant RB47 as described in Example 4B is co- 
transformed into the E. coli host cell for expression. 
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The plasmid referred to as pDl/RB6 0 is prepared as 
described above for pDl/RB47 in Example 4A with the 
exception that Xhol and Xbal sites are created on RB60 
rather than RB47. 

5 Expression of pDl/RB60 in E. coli to produce 

recombinant RB60 and the recombinant heterologous 
protein is performed as described in above with the 
combined expression of RB47 from a separate expression 
cassette. The heterologous protein is then purified as 

0 further described. 

To produce an expression cassette that allows for 
insertion of an alternative desired coding sequence, the 
plasmid pDl/RB60 is digested with Ndel and Xhol 
resulting in a vector having restriction endonuclease 

5 sites for insertion of a desired coding sequence 

operably linked to a RB47 binding site and RBGO coding 
sequence on one transcriptional unit. 

G. Construct s Containing RB47 Binding Site 
0 Nucleotide Sequence and Heterologous Coding 

Sequences 

1) Ex pressio n of Recombinant Tetanus Toxin 
Single chain Antibody 

The examples herein, describe constructs 
5 that are variations of those described above. The 

constructs described below contain an RB47 binding site 
sequence and a heterologous coding sequence. The 
activating protein RB47 was endogenously provided in the 
chloroplast and or plant cell. In other aspects however 
0 as taught by the methods of the present invention, the 
chloroplast is further transformed with an RB47- 
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expression construct as described above for 
overexpression of RB47 to enhance translation 
capacities . 

A strain of the green algae Chlamydomonas 
reinhardtii was designed to allow expression of a single 
chain antibody gene in the chloroplast. The 
transgenically expressed antibody was produced from a 
chimeric gene containing the promoter and 5 ' 
untranslated region (UTR) of the chloroplast psbA gene 
prepared as described above, followed by the coding 
region of a single chain antibody (encoding a tetanus 
toxin binding antibody) , and then the 3 ■ UTR of the psbA 
gene also prepared as described above to provide for 
transcription termination and RNA processing signals. 
This construct is essentially pDl/Nde including a 
heterologous coding sequence having a 3 ' Xbal 
restriction site for ligation with the 3' psbA gene and 
is diagramed in Figure 7 . 

The psbA- single chain construct was first 
transformed into C. reinhardtii chloroplast and 
transformants were then screened for single chain gene 
integration. Transformation of chloroplast was 
performed via bolistic delivery as described in US 
Patents 5,545,818 and 5,553,878, the disclosures of 
which are hereby incorporated by reference. 
Transformation is accomplished by homologous 
recombination via the 5' and 3' UTR of the psbA mRNA. 

As shown in Figure 8, two of the transformants that 
contained the single chain chimeric gene produced single 
chain antibodies at approximately 1% of total protein 
levels. The transgenic antibodies were of the correct 
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size and were completely soluble, as would be expected 
of a correctly folded protein. Few degradation 
products were detectable by this Western analysis, 
suggesting that the proteins were fairly stable within 
5 the chloroplast. To identify if the produced antibody 
retained the binding capacity for tetanus toxin, EL ISA 
assays were performed using a mouse -produced Fab, from 
the original tetanus toxin antibody, as the control. 
The chloroplast single chain antibody bound tetanus 
" ~ 10 toxin at levels similar to Fab, indicating that the 

single chain antibody produced in C. reinhardtii is a 
fully functional antibody. These results clearly 
demonstrate the ability of the chloroplast to synthesis 
and accumulate function antibody molecules resulting 
15 from the translational activation of an RB47 binding 
site in an expression cassette by endogenous RB47 
protein in the chloroplast. 



contain more than one subunit, such as dlgA and 
bacterial lucif erase enzyme, several proteins must be 
produced in stoichiometric quantities within the 

25 chloroplast. Chloroplast have an advantage for this 

type of production over cytoplasmic protein synthesis in 
that translation of multiple proteins can originate from 
a single mRNA. For example, a dicistronic mRNA having 
5' and 3 1 Ndel and Xbal restriction sites and containing 

3 0 both the A and B chains of the bacterial lucif erase 

enzyme was inserted downstream of the psjbA promoter and 



2) 



Expression of Bacterial Luciferase Enzyme 



20 



Having Two Subunit s 



For the production of molecules that 
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5 1 UTR of the pDl/Nde construct prepared in Example 4A 
above. In this construct, the bacterial LuxAB coding 
region was ligated between the psbA 5* UTR and the psbA 
3 • end in an E. coli plasmid that was then transformed 
5 into Chlamydomonas reinhardtii cells as described above 
for expression in the chloroplast . A schematic of the 
construct is shown in Figure 9. Single transformant 
colonies were then isolated. A plate containing a 
single isolate was grown for 10 days on complete media 
10 and a drop of the lucif erase substrate n-Decyl Aldehyde 
was placed on the plate and the luciferase visualized by 
video-photography in a dark chamber. Both proteins were 
synthesized from this single mRNA and luciferase 
activity accumulated within the chloroplast as shown in 
15 Figure 10. Some mRNA within plastids contained as many 
= as 5 separate proteins encoded on a single mRNA. 

3) Repressio n of Dimeric IgA 

To generate dimeric IgA, the construct 
20 shown in Figure 11 is engineered so that the psbA 

promoter and 5' UTR are used to drive the synthesis of 
the light chain and heavy chains of an antibody, and the 
J chain normally associated with IgA molecules. The 
nucleic acid sequences for the dimeric IgA are inserted 

2 5 into the RB47 binding site construct prepared in Example 

4A. The construct is then transformed into C. 
reinhardtii cells as previously described for expression 
of the recombinant dlgA. 

Production of these three proteins within the 

3 0 plastid allows for the self assembly of a dimeric IgA 

(dlgA) . Production of this complex is monitored in 
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several ways. First, Southern analysis of transgenic 
algae is used to identify strains containing the 
polycistronic chimeric dlgA gene. Strains positive for 
integration of the dlgA gene are screened by Northern 
5 analysis to ensure that the chimeric mRNA is 

accumulating. Western blot analysis using denaturing 
gels is used to monitor the accumulation of the 
individual light, heavy and J chain proteins, and native 
gels Western blot analysis will be used to monitor the 

10 accumulation of the assembled dlgA molecule. 

By using a single polycistronic mRNA in the context 
of RB47 regulated translation, two of the potential 
pitfalls in the assembly of multimeric dlgA molecule are 
overcome. First, this construct ensures approximately 

15 stoichiometric synthesis of the subunits, as ribosomes 
reading through the first protein are likely to continue 

- to read through the second and third proteins as well. 
Second, all of the subunits are synthesized in close 

- physical proximity to each other, which increases the 
20 probability of the proteins self assembling into a 

multimeric molecule. Following the production of a 
strain producing dlgA molecules, the production of dlgA 
on an intermediate scale by growing algae in 300 liter 
fermentors is then performed. Larger production scales 

25 are then performed thereafter. 

The foregoing specification, including the specific 
embodiments and examples, is intended to be illustrative 
of the present invention and is not to be taken as 
limiting. Numerous other variations and modifications 

3 0 can be effected without departing from the true spirit 
and scope of the invention. 
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What Is Claimed Is: 

1. An expression cassette for expression of a 
desired molecule, which cassette comprises: 

a) an RB47 binding site nucleotide sequence 
5 upstream of a restriction endonuclease site for 

insertion of a desired coding sequence to be expressed; 
and 

b) a nucleotide sequence encoding a 
polypeptide which binds RB47 binding site. 

10 2. The expression cassette of claim 1 further 

comprising a promoter sequence operably linked to and 
positioned upstream of the RB47 binding site nucleotide 
sequence . 

3. The expression cassette of claim 2 wherein the 
15 promoter sequence is derived from a psbA gene. 

4. The expression cassette of claim 3 wherein the 
coding sequence is heterologous to the psbA gene. 

; 5. The expression cassette of claim 1 wherein the 

cassette comprises a plasmid or virus. 
20 6. The expression cassette of claim 1 further 

comprising and operably linked thereto a nucleotide 
sequence encoding RB60. 

7. The expression cassette of claim 1 wherein the 
RB47 binding polypeptide is selected from the group 

25 consisting of RB47, RB47 precursor and a histidine- 
modified RB47. 

8. An expression cassette for expression of a 
desired molecule, which cassette comprises: 

a) an RB47 binding site nucleotide sequence 
3 0 upstream of a restriction endonuclease site for 

insertion of a desired coding sequence to be expressed; 
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and 

b) a nucleotide sequence encoding a 
polypeptide which regulates the binding of RB47 to the 
RB47 binding site. 
5 9. The expression cassette of claim 8 wherein the 

regulatory polypeptide is RB60. 

10. A recombinant RB47 protein. 

11. A recombinant RB60 protein. 

12. An isolated nucleotide sequence encoding RB47. 
10 13 . An isolated nucleotide sequence encoding a 

histidine-modif ied RB47. 

14. An isolated nucleotide sequence encoding RB47 
\ precursor. 

15. The nucleotide sequence of claim 12 from 

15 nucleotide position 197 to 1402 in Figures 1A-1B and SEQ 
: ID NO 5. 

~k 16. The nucleotide sequence of claim 13 from 

= nucleotide position 1 to 12 6 9 in Figures 5A-5B and SEQ 
ID NO 14. 

20 17. The nucleotide sequence of claim 14 shown in 

from nucleotide position 197 to 2065 in Figures 1A-1C 
and SEQ ID NO 5 . 

18 . An expression cassette comprising the 
nucleotide sequence of claim 12, 13 or 14. 

25 i9. An isolated nucleotide sequence encoding RB60. 

20. The nucleotide sequence of claim 18 from 
nucleotide position 16 to 1614 in Figures 2A-2B and SEQ 
ID NO 10. 

21. An expression cassette comprising the 
30 nucleotide sequence of claim 19. 

22 . An expression system comprising a cell 
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transformed with the expression cassette of claim 1. 

23 . The expression system of claim 22 wherein the 
cell is a plant cell. 

24. The expression system of claim 23 wherein the 
5 plant cell endogenously expresses RB47. 

25. The expression system of claim 23 wherein the 
plant cell endogenously expresses RB60 . 

26. The expression system of claim 23 wherein the 
plant cell endogenously expresses RB47 and RB60. 

10 27. The expression system of claim 22 wherein the 

cell is a eukaryotic cell. 
."» 28. The expression system of claim 22 wherein the 

-f cell is a prokaryotic cell. 

fj 29. The expression system of claim 22 further 

15 comprising the expression cassette of claim 21. 
=! 30. An expression system comprising a cell 

transformed with the expression cassette of claim 8. 
i 31. The expression system of claim.. 29 further 

d comprising the expression cassette of claim 18. 
20 32. A cell stably transformed with the expression 

cassette of claim 18. 

33 . A cell stably transformed with the expression 
cassette of claim 21. 

34 . A cell stably transformed with the expression 
25 cassette of claims 18 and 21. 

35. The expression cassette of claim 1 further 
comprising an inserted desired coding sequence. 

36. An expression system comprising a*' cell 
transformed with the expression cassette of claim 35, 

3 0 wherein the coding sequence is expressed forming the 
desired molecule upon activation of the RB4 7 binding 
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site with RB47. 



37. The expression system of claim 36 wherein the 
cell is a plant cell endogenously expressing RB47. 

38. The expression system of claim 36 wherein the 



5 cell is stably transformed with the expression cassette 
of claim 21. 

39. An expression system comprising a cell 
transformed with an expression cassette comprising a 
promoter sequence, a RB47 binding site sequence, a 

10 desired coding sequence for a molecule, and a nucleotide 
sequence for encoding a polypeptide which binds RB47 
binding site, wherein all sequences are operably linked. 

40. A method of preparing a desired recombinant 
molecule wherein the method comprises cultivating the 

15 expression system of claim 36. 

: 41. A method of preparing a desired recombinant 

molecule wherein the method comprises cultivating the 
expression system of claim 39. 

" 42. A method for expressing a desired coding 

2 0 sequence comprising: 

a) forming an expression cassette by 
operably linking: 



1) 



a promoter sequence; 



25 



2) a RB47 binding site sequence; 

3) a desired coding sequence; and 

4) a nucleotide sequence encoding a 



polypeptide which binds RB47 binding site; and 



b) introducing the expression cassette into 



a cell. 



30 



43 . 



The method of claim 42 wherein the cell is a 



plant cell endogenously expressing RB47. 
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44 . The method of claim 42 wherein the cell is a 
plant cell endogenously expressing RB60. 

45. The method of claim 42 further comprising 
inducing expression with a promoter inducer molecule. 

5 46. The method of claim 45 wherein the promoter 

inducer molecule is IPTG. 

47. The method of claim 42 wherein the cell is 
transformed with the expression cassette of claim 21. 

48. A method for expressing a desired coding 
10 sequence comprising: 

a) forming an expression cassette by 
~: operably linking: 

1) a promoter sequence; 

2) a RB47 binding site sequence; and 
15 3) a desired coding sequence; 

and 

b) introducing the expression cassette into 
a plant cell endogenously expressing RB47. 

49. The method of claim 48 wherein the expression 
20 cassette further comprises a nucleotide sequence 

encoding RB60. 

50. A method for the regulated production of a 
recombinant molecule from a desired coding sequence in a 
cell, wherein the cell contains the expression cassette 

25 of claim 34, wherein expression of the coding sequence 
is activated by RB47 binding to the RB47 binding site 
thereby producing the recombinant molecule. 

51. A method of forming an expression cassette by 
operably linking: 

3 0 a) a RB47 binding site sequence; 

b) a cloning site for insertion of a desired 
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coding sequence downstream of the RB47 binding site 
sequence ; and 

c) a nucleotide sequence encoding a 
polypeptide which binds the RB47 binding site. 

52. The method of claim 51 further comprising a 
promoter sequence operably linked upstream to the RB47 
binding site sequence. 

53. The method of claim 51 further comprising a 
desired coding sequence inserted into the insertion 
site . 

54 . A method of screening for agonists or 
antagonists of RB47 binding to RB47 binding site, the 
method comprising the steps : 

a) providing a cell expression system 
containing: 

1) a promoter sequence ; 

2) a RB47 binding site sequence ; 

3) a coding sequence for an indicator 
polypeptide; and 

4) a polypeptide which binds to the 
RB47 binding site sequence; 

b) introducing an antagonist or agonist into 
the cell; and 

c) detecting the amount of indicator 
polypeptide expressed in the cell. 

55. A method of screening for agonists or 
antagonists of RB60 in regulating RB47 binding to RB47 
binding site, the method comprising the steps: 

a) providing an expression system in a cell 
containing: 

1) a promoter sequence; 
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2) a RB47 binding site sequence; 

3) a coding sequence for an indicator 

polypeptide ; 

4) a polypeptide which binds to the 
5 RB47 binding site sequence; and 

5) a RB60 polypeptide ; 

b) introducing an agonist or antagonist into 
the cell; and 

c) detecting the amount of indicator 
10 polypeptide expressed in the cell. 

56 . An article of manufacture comprising a 
packaging material and contained therein in a separate 
container the expression cassette of claim 1, wherein 
the expression cassette is useful for expression of a 
13 desired coding sequence, and wherein the packaging 
material comprises a label which indicates that the 
expression cassette can be used for expressing a desired 
coding sequence when the RB47 binding site is activated 
by RB47. 

20 57. The article of manufacture of claim 56 further 

comprising in a separate container the expression 
cassette of claim 18 . 

58. The article of manufacture of claim 56 further 
comprising in a separate container the expression 

25 cassette of claim 21. 

59. An article of manufacture comprising a 
packaging material and contained therein in a separate 
container the expression system of claim 22, wherein the 
expression system is useful for expression of a desired 

3 0 coding sequence, and wherein the packaging material 
comprises a label which indicates that the expression 
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system can be used for expressing a desired coding 
sequence when the RB47 binding site is activated by 
RB4 7. 

60. An article of manufacture comprising a 

5 packaging material and contained therein in a separate 
container the stably transformed cell of claim 32, 
wherein the cell is useful as an expression system, and 
wherein the packaging material comprises a label which 
indicates that the expression system can be used for 
10 expressing a desired coding sequence when the RB47 
binding site is activated by RB47. 

61. An article of manufacture comprising a 
packaging material and contained therein in a separate 
container the stably transformed cell of claim 33, 

15 wherein the cell is useful as an expression system, and 
wherein the packaging material comprises a label which 
indicates that the expression system can be used for 
expressing a desired coding sequence when the RB47 

: binding site is activated by RB47 and regulated by RB60. 

20 62 . An article of manufacture comprising a 

packaging material and contained therein in a separate 
container the stably transformed cell of claim 34, 
wherein the cell is useful as an expression system, and 
wherein the packaging material comprises a label which 

25 indicates that the expression system can be used for 
expressing a desired coding sequence when the RB4 7 
binding site is activated by RB47 and regulated by RB60. 

63. An article of manufacture comprising a 
packaging material and contained therein in a separate 
30 container the expression cassette of claim 2, wherein 
the expression cassette is useful for expression of a 
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RNA transcript, and wherein the packaging material 
comprises a label which indicates that the expression 
cassette can be used for producing in vitro a RNA 
transcript when the RB47 binding site is activated by 
5 RB47. 

64 . The article of manufacture of claim 63 wherein 
the promoter sequence is selected from the group 
consisting of T3 and T7 promoters. 

65. The article of manufacture of claim 63 further 
0 comprising in separate containers a polymerase, a buffer 

and each of four ribonucleotides, reagents for in vitro 
RNA transcription. 
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I HEREBY DECLARE THAT: 

My residence, post office address, and citizenship are as stated next to my name in PART 
A of page 2 hereof. 

I believe I am the original, first, and sole inventor (if only one name is listed) or 
an original, first, and joint inventor (if plural names are listed) of the subject 
matter which is claimed and for which a patent is sought on the invention entitled RNA 
BINDING PROTEIN AND BINDING SITE USEFUL FOR EXPRESSION OF RECOMBINANT MOLECULES the 
specification of which: 

is attached hereto 

X was filed on January 16. 1998, as Application Serial No. PCT/US98/00840 
and was amended on (if applicable) . 

I hereby state that I have reviewed and understand the contents of the above -identified 
specification, including the claims, as amended by any amendment referred to above. 

I acknowledge the duty to disclose information which is material to the examination of 
this application in accordance with Title 37, Code of Federal Regulations, Sec. 1.56(a) . 

I hereby claim foreign priority benefits under Title 35, United States Code, Sec. 119 
of arif foreign application (s) for patent or inventor's certificate listed in PART B on 
page :'~2 hereof and have also identified in PART B on page 2 hereof any foreign 
application for patent or inventor's certificate having a filing date before that of the 
application on which priority is claimed. 

I hereby claim the benefit under Title 35, United States Code, Sec. 120 of any United 
Statejb application (s) listed in PART C on page 2 hereof and, insofar as the subject 
matter of each of the claims of this application is not disclosed in the prior United 
States application in the manner provided by the first paragraph of Title 35, United 
StatMs Code, Sec. 112, I acknowledge the duty to disclose material information as 
defiifed in Title 37, Code of Federal Regulations, Sec. 1.56(a) which occurred between 
the filing date of the prior application and the national or PCT international filing 
date l ±>f this application. 

I he£eby declare that all statements made herein of my knowledge are true and that all 
statements made on information and belief are believed to be true; and further that 
these statements were made with the knowledge that willful false statements and the like 
so made are punishable by fine or imprisonment, or both, under Section 1001 of Title 18 
of the United States Code and that such willful false statements may jeopardize the 
validity of the application or any patent issued thereon. 

I hereby appoint the following as my attorneys or agents with full power of substitution 
to prosecute this application and transact all business in the United States Patent and 
Trademark Office connected therewith: 

Douglas A. Bingham Reg. No. 32,457 Thomas E. Northrup Reg. No. J33,268_ 

Thomas Fitting Reg. No. 3 ^4 . 163_ Emily Holmes Reg. N o. 40,6 52 

Donald G. Lewis Reg. No. 28 , 63 6 _ 

whose mailing address for this application is: 

THE £_CRIPPfi RESEARCH TNRTTTUTF 

10550 North Tor rey Pines Road, Mail Drop TPC-8 
Tg .Toll a, Pali f orma 92037 
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Inventor ' s Signature : 



Full name of SECOND joint inventor, if an y 
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Full name of THIRD joint inventor, if any 
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Full^name of FOURTH joint inventor, if any 
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Fourth Inventor's Signature 
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PART B: Prior Foreign Application (s) 

Serial No. Country Day/Month/Year Filed Priority Claimed 
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PART C: Claim For Benefit of Filing Date of Earlier U.S. Application (s) 
Serial No. Filing Date Status: 
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